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ABSTRACT 


In  the  problem  under  consideration  a decision  maker  has  a 
total  of  M candidates  to  interview  sequentially.  The  de- 
cision maker  must  either  accept  or  reject  the  candidate  being 
interviewed  after  he  has  been  ranked  with  respect  to  his 
predecessors.  Once  rejected  a candidate  cannot  be  reconsid- 
ered; once  a candidate  is  accepted  no  further  interviews  are 
carried  out.  The  objective  is  to  select  the  candidate  in 
such  a way  as  to  maximize  the  probability  of  choosing  the 
best  of  all  M candidates  (assuming  every  ordering  of  inter- 
views is  equally  likely).  In  this  paper  we  allow  M to  be  a 
random  variable  and  determine  sufficient  conditions  on  its 
distribution  so  as  to  obtain  an  optimal  policy  of  simple  form. 
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ON  THE  CANDIDATE  PROBLEM  WITH  A RANDOM  NUMBER  OF  CANDIDATES- 


by 

C.  Derman,  Columbia  University 
G.J.  Lieberman,  Stanford  University 
S.M.  Ross,  University  of  California,  Berkeley 


1.  Introduction 

In  the  problem  under  consideration  a decision  maker  has  a 
total  of  M candidates  to  interview  sequentially.  The  decision 
maker  must  either  accept  or  reject  the  candidate  being  interviewed 
after  he  has  been  ranked  with  respect  to  his  predecessors.  Once 
rejected  a candidate  cannot  be  reconsidered;  once  a candidate  is 
accepted  no  further  interviews  are  carried  out.  The  objective  is 
to  select  the  candidate  in  such  a way  as  to  maximize  the  probability 
of  choosing  the  best  of  all  M candidates  (assuming  every  ordering 
of  interviews  is  equally  likely) . 

In  the  classical  version,  M = m is  fixed  and  the  optimal 
selection  policy  is  to  interview  r - 1 candidates  without  choosing 
any;  then  select  the  first  leading  candidate  interviewed  thereafter. 
Here 


r *= 


r*(m) 


min 


m-1 

r >.  l|  l l/k<  1 
k*=r 
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University  of  California,  by  the  Air  Force  Office  of  Scientific 
Research  (AFSC),  USAF,  under  Grant  AFOSR-77-3213  with  the  University 
of  California,  and  by  the  National  Science  Foundation  under  Grant 
MCS-7825-146  with  Columbia  University. 
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(See  Chow,  et.al.  [1]  p.  51  or  Derman  [3]  p.  118.) 

Rasmussen  [5]  and  Rasmussen  and  Robbins  [6]  generalize  the 
problem  to  the  case  where  M is  random  with  known  probability  function 
P{M  = m)  = p^,  m = 1,  . N,  N < ®.  The  approach  taken  in  [6]  rests 
on  the  presumption  (asserted  in  [5])  that  the  optimal  solution  policy 
possesses  the  above  simple  form  and  that  one  need  only  find  the  value 
of  r that  maximizes 


<P{  r) 


N 

l Pm/m’ 

m=l 

N m-1 

(r-D  I Pm/m  1 1/k, 

ra=r  k=r-l 


if  r = 1 

if  r > 2 , 


the  a priori  probability  of  selecting  the  best  candidate  under  the 
rule  which  lets  the  first  r - 1 go  by  and  then  selects  the  first 
leader  that  occurs  thereafter. 

The  assertion  of  [5],  however,  is  not  true  for  every  p'  bability 

function  {p  }.  In  Section  2 a counter-example  shows  this.  In  Section 
m 

3 a sufficient  condition  is  given  for  the  optimal  policy  to  have  the 
simple  form.  The  device  for  generating  the  condition  is  the  "one-step 
look  ahead"  criterion  given  by  Derman  and  Sacks  [4]  (also  by  Chow  and 
Robbins  [2]).  It  turns  out  that  this  criterion  applies  if  and  only  if 
(p{ r)  is  unimodal.  The  condition  given  here  for  the  criterion 
to  apply  improves  on  the  one  given  in  [6]  for  </>(r)  to  be  unimodal. 


2 


2.  Counter-example 


If  {p  ) were  a two-point  distribution  with  large  probability 
m 

mass  at  M = 2 (say)  and  small,  but,  positive,  probability  mass  at 

M = N(N  large),  a simple  selection  policy  would  have  r = 1 or  2. 
Intuitively,  however,  it  would  seem  that  a better  policy  would  be  to 

select  the  second  candidate  if  he  was  better  than  the  first  and  if 
not  to  let  a number  of  candidates  go  by  before  deciding  to  choose 
the  next  leading  candidate.  Formally,  suppose 


p2  = 1 " C’  PN  = E ' 


Then 


<p(  1)  - e/N 

N-l 

<P( 2)  - + | l l/k 

1 k=l 

(p{ r)  < e,  3 1 r < N . 

So  for  N > 2 and  E small 


<p( 2)  = max  <p( r) 

r 
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However,  consider  the  policy  that  stops  at  r = 2,  if  possible, 
or,  if  M = N,  at  the  first  leading  candidate  after  i - 1 candidates 
have  been  interviewed,  where  i shall  be  appropriately  determined.  Let 
<P  denote  the  probability  of  selecting  the  best  candidate  under  this 
policy.  Then, 


< p = P{M  = 2)  P{Selecting  Best  Candidate |m  = 2} 


+ P{M  = N}  P{Selecting  Best  CandidatejM  = N) 


1-e 


+ e 


1/2  • 2/N  + 1/2  • Y 1A 

\ * k=i-3 


1-e 


+ e/N 


1 + 


i-3 


N-3 


l 1/k 


k*i-3 


Thus, 


(p  > ip(2) 


if 


i + l i/k  > l i/k  . 


N-3 


N-l 


k=i-3 


k=l 
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If  i is  any  integer  greater  than  5 and  N is  sufficiently  large, 
the  latter  inequality  will  hold. 


3.  One-step  Look  Ahead  Policies 

Consistent  with  our  use  of  the  expression  in  the  foregoing,  we 
say  a leading  candidate  is  being  interviewed  if  he  ranks  above  all 
previously  interviewed  candidates . A one-step  look  ahead  policy  is  one 
that  accepts  a leading  candidate  if  and  only  if  the  probability  that  he 
is  the  best  exceeds  the  probability  that  one  more  leader  will  be  inter- 
viewed and  that  he  will  be  the  best.  Applying  a theorem  by  Derman  and 
Sacks  [4]  (also  Chow  and  Robbins  [2]),  the  one-step  look  ahead  criterion 
yields  an  optimal  selection  policy  if  there  is  an  r - 1 such  that 
one-step  look  ahead  accepts  (rejects)  a leader  among  the  first  j 
interviewed  if  j > r - l(j  <_  r - 1). 

Let 


p(ra|  i) 


m > i , 


denote  the 


denote 
^ denote 


conditional  probability  that  M = m given  M ^ i.  Let 
the  event  that  the  jtl1  candidate  interviewed  is  a leader, 
the  event  that  the  j^  candidate  is  the  best.  Then 


(say)  . 


N 

«f)-j  l p(mh)  = R(j) 

J m=j  m 


. th 


Let  <&  denote  the  event  that  the  j candidate  observed  is  a leader, 

one  more  leader  will  be  observed  and  that  he  will  be  the  best.  Then 


m 


P{^ 


UM>j,  <?}=  I p(ra|  j)  [ __j Jc 

J J ra=j+l  k=j+l  k(k-l)  m 


N , . m-1 

3 I l 1/k 

m=j+l  k=j 


= S(j)  (say) 


Thus,  the  one-step  look  policy  is  optimal  if  for  some  r - 1 


(1) 


R(j)  - S(j)  < 0,  j < r - 1 
> 0,  j > r 


j-1 

Now,  interpreting  [ 1/k  **  0, 

k-j 


R(j)  - S(j)  = J 


! T 

m=j  m=  j+1  “ k=j 


N 


m«J 


N p / m-1  \ 

I vf1-  I i/») 

n-j  \ k-j  / 


rm 
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i 


*(j) 


m=j 


m 


where  ^(j)  is  the  expression  within  the  last  brackets.  The  sign  of 
R(j)  “ S(j)  is  the  same  as  that  of  <Kj).  Thus,  from  (1)  follows 

Proposition  1;  A one-step  look  ahead  policy  is  optimal  if  for 
some  r - 1 


(2)  *<j)  < 0,  j = 1,  ....  r - 1 

*^(j)  > 0»  j = •••»  ^ • 

The  hypothesis  of  proposition  1 is  equivalent  to  <p( r)  being 
unimodal  since,  as  given  in  [6], 


<p(j)  - = 'l'(j).  j = 1 


Now  let 


N 

H,  **  p.  “ l P /m,  j = 1,  ....  N - 1 
3 3 m=j+l 


In  terms  of  we  state 


Condition 


(*):  > 0 implies  _>  0 for  i > j . 
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We  now  have 


Theorem  1:  If  condition  (*)  holds,  then  there  is  an  r such  that 
(2)  holds  (i.e.,  a one-stop  look  ahead  policy  is  optimal). 

Proof : In  Rasmussen  and  Robbins  [6]  it  is  given  that 

iKj)  - *(j+l)  = H /j,  j = 1.  ....  N - 1 . 

Let  i be  any  integer  such  that 

i|>(i)  >_  0 . 


If  H.  < 0,  then 

l 


ij/(i+l)  > iKi) 

1 0 • 

If  0,  then  by  condition  (*) 

i »•  i ‘ 1 + 1 “ - L' 

and  hence,  is  non-increasing  in  j for  all  j,  i ± j £ N.  However 

<l>( N)  > 0; 


n 
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therefore,  ;Kj)  1 0 for  all  j,  i < j < N.  Thus,  in  both  cases 
ij)(i+l)  >_  0,  proving  the  theorem. 

Remarks : Condition  (*)  is  weaker  than  the  condition  £ ?2  £ **’  — 
given  in  [6].  It  is  also  satisfied  by  the  geometric  distribution  over 
1,  N,  a distribution  for  which  (p(r)  is  uniraodal  (as  stated  in  [6]). 

It  is  not  necessary  to  restrict  N to  being  finite;  under  condition  (*) 
Theorem  1 will  hold  with  N = °°  since,  in  this  case,  lim  ifi(j)  =0  . 
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